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DETECTION OF SINGLE NUCLEOTIDE POLYMORPHISMS 



This application claims benefit of U.S. Provisional Patent Application 
Serial No. 60/179,844, filed on February 2, 2000. 

FIELD OF THE INVENTION 

5 The present invention relates to the detection of single nucleotide 

polymorphisms. 

BACKGROUND OF THE INVENTION 

Single-nucleotide polymorphisms (SNPs) are the most frequent type of 
variation in the human genome with an estimated frequency of one to two 

10 polymorphic nucleotides per kilobase (Schafer et al., Nat Biotechnol 16: 33-9 (1998); 
Brookes, Gene 234: 177-86 (1999)). SNPs can serve as genetic markers for 
identifying disease genes by linkage studies in families, linkage disequilibrium in 
isolated populations, association analysis of patients and controls, and loss-of- 
heterozygosity studies in tumors (Wang et al, Science 280: 1077-82 (1998)). 

15 Although some SNPs in single genes are associated with heritable diseases such as 
cystic fibrosis, sickle cell anemia, colorectal cancer, and retinitis pigmentosa (Kerem 
et al, Science 245: 1073-80 (1989); Fearon et al, Cell 61: 759-67 (1990); Sung et al, 
Proc Natl Acad Sci U S A 88: 6481-5 (1991)), most SNPs are "silent". They can alter 
phenotype by either controlling the splicing together of exon from intron-containing 

20 genes or changing the way mRNA folds. Recently, there has been increasing 

knowledge of the genetic basis of SNPs for individual differences in drug response 
(McCarthy et al., Nat Biotechnol 18: 505-8 (2000); Roses, Nature 405: 857-65 
(2000)). Insights into differences between alleles or mutations present in different 
individuals can also illuminate the interplay of environment with disease 

25 susceptibility. For example, in the p5i tumor suppressor gene, over 400 mutations 
have been found to be associated with tumors and used to determine individuals with 
increased cancer risk (Kurian et al., J Pathol 187: 267-71 (1999)). All these 
applications involve the analysis of a large number of samples and will eventually 
require rapid, inexpensive, and highly automated methods for genotyping analysis. 
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Because of the importance of identifying SNPs, a number of gel-based 
methods have been described for their detection and genotyping. These methods 
include single strand conformational polymorphism analysis, heteroduplex analysis, 
denaturing gradient gel electrophoresis, and chemical or enzyme mismatch 
5 modification assays (Schafer and Hawkins, Nat Biotechnol 16: 33-9 (1998)). To 
facilitate large-scale SNP identification, new technologies are being developed to 
replace the conventional gel-based re-sequencing methods. Perhaps the most widely 
employed techniques currently used for SNP identification are array hybridization 
assays, such as allele specific oligonucleotide microarrays in miniaturized assays 

10 (Wang, Fan et al., Science 280: 1077-82 (1998)). This approach relies on the capacity 
to distinguish a perfect match from a single base mismatch by hybridization of target 
DNA to a related set of four groups of oligonucleotides that are identical except for 
the base centrally located in the oligonucleotide. Mismatches in the central base of 
the oligonucleotide sequence have a greater destabilizing effect than mispairing at 

1 5 distal positions during hybridization. Thus, this strategy developed by Affymetrix 
utilizes a set of four oligonucleotides for each base to re-sequence. For example, a 
10-kb gene requires a microarray of 40,000 oligos that can be accomplished by on- 
chip photolithographic synthesis (Ramsay, Nat Biotechnol 16: 40-4 (1998)). The 
mutation detection is based on the development of a two-color labeling scheme, in 

20 which the reference DNA is labeled with phycoerythrin (red) during the PCR 
amplification, while the target DNA is labeled with fluorescein (green). Both 
reference and target samples can then be hybridized in parallel to separate chips with 
identically synthesized arrays or co-hybridized to the same chip. The signal of 
hybridization of fluorescent products is recorded through confocal microscopy. 

25 Comparison of the images for a target sample and reference sample can yield the 

genotype of the target sample for thousands of SNPs being tested. By processing co- 
hybridization of the reference and target samples together, experimental variability 
during the subsequent fragmentation, hybridization, washing, and detection steps can 
be minimized to make array hybridization more reproducible. The interpretation of 

30 the result is based on the ratios between the hybridization signals from the reference 
and the target DNA with each probe (Hacia et al, Nat Genet 14: 441-7 (1996)). 



Despite the impressive technology that is emerging for the 
hybridization to oligonucleotide arrays, potential problems with these approaches 
exist due to several factors. One limiting factor originates from the inherent 
properties of the nucleic acid hybridization. The efficiency of hybridization and 
5 thermal stability of hybrids formed between the target DNA and a short 

oligonucleotide probe depend strongly on the nucleotide sequence of the probe and 
the stringency of the reaction conditions. Furthermore, the degree of destabilization 
of the hybrid molecule by a mismatched base at one position is dependent on the 
flanking nucleotide sequence. As a result, it would be difficult to design a single set 

10 of hybridization conditions that would provide optimal signal intensities and 
discrimination of a large number of sequence variants simultaneously. This is 
particularly true for human genomic DNA which is present either in heterozygous or 
homozygous form. In addition, the necessity of using DNA chips composed of tens 
of oligonucleotide probes per analyzed nucleotide position has led to a complex setup 

1 5 of assays and requires mathematical algorithms for interpretation of the data. 

Another popular method for high-throughput SNP analysis is called 5' 
exonuclease assay in which two fluorogenic probes, double-labeled with a fluorescent 
reporter dye (FAM or TET) and a quencher dye (TAMRA) are included in a typical 
PGR amplification (Lee et al., Nucleic Acids Res 21 : 3761-6 (1993); Morin et al., 

20 Biotechniques 27: 538-40, 542, 544 passim (1999)). During PCR, the allele-specific 
probes are cleaved by the 5' exonuclease activity of Tag DNA polymerase, only if 
they are perfectly annealed to the segment being amplified. Cleavage of the probes 
generates an increase in the fluorescence intensity of the reporter dye. As a result, 
both report fluorescence that can be plotted and segregated to determine the template 

25 genotype. The advantage of this approach is to virtually eliminate post-PCR 

processing. However, the apparent drawbacks of this technique relate to the time and 
expense of establishing and optimizing conditions for each locus. 

Another widely accepted method to identify SNPs is called single 
nucleotide primer extension (SNuPE), also known as mini sequencing (Nikiforov et 

30 al., Nucleic Acids Res 22: 4167-75 (1994); Pastinen et al., Clin Chem 42: 1391-17 

(1996); Landegren et al., Genome Res 8: 769-76 (1998)). This technique involves the 
hybridization of a primer immediately adjacent to the polymorphic locus, extension 
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by a single dideoxynucleoti.de, and identification of the extended primer. An 
advantage of this approach, compared to hybridization with oligonucleotide probes, is 
that all variable nucleotides are identified with optimal discrimination using the same 
reaction conditions. Consequently, at least one order of magnitude of higher power 
5 for discriminating between genotyping is available using this method than with 
hybridization of allele-specific oligonucleotide probes in the same array format 
(Pastinen et al., Genome Res 7: 606-14 (1997)). 

Since the first introduction of SNuPE for the identification of genetic 
disease (Kuppuswamy et al, Proc Natl Acad Sci U S A 88: 1 143-7 (1991)), several 

10 new detection methods have been developed including luminous detection (Nyren et 
al, Anal Biochem 208: 171-5 (1993)), colorimetric ELISA (Nikiforov et al., Nucleic 
Acids Res 22: 4167-75 (1994)), gel-based fluorescent assays (Pastinen et al., Clin 
Chem 42: 1391-7 (1996)), homogeneous fluorescent detection (Chen et al., Genet 
Anal 14: 157-63 (1999)), flow cytometry-based assays (Cai et al., Genomics 66: 135- 

15 43 (2000)), HPLC analysis (Hoogendoorn et al., Hum Genet 104: 89-93 (1999)). 
Recently, a combination of single nucleotide primer extension and matrix assisted 
laser desorption ionization-time of flight mass spectrometry (MALDI-TOFMS) 
detection has been developed (Haff et al, Genome Res 7: 378-88 (1997); Griffin et 
al., Trends Biotechnol 18: 77-84 (2000); Sauer et al., Nucleic Acids Res 28: E13 

20 (2000)). This approach allows the determination of SNP sequences by measuring the 
mass difference between the known primer mass and the extended primer mass using 
MALDI-TOFMS. Discrimination of mass differences of less than 1 part in 1,000 is 
required to determine which of the four dideoxynucleotide triphosphate bases 
(ddNTPs), dideoxy-cytidine triphosphate (ddCTP), dideoxy-thymidine triphosphate 

25 (ddTTP), dideoxy-adenosine triphosphate (ddATP), and dideoxy-guanosine 

triphosphate (ddGTP) reacted to extend the primer. A desired capability of this 
technique includes the analysis of heterozygotes where two different bases are present 
at the same nucleotide position. The MALDI-TOFMS measurement requires the 
discrimination of two mass-resolved species that represent the addition of both bases 

30 complementary to those at the SNP site. This requires MALDI-TOFMS methods 
incorporating high mass resolution capabilities and enhanced sensitivity. Compared 
to the detection of a fluorescence-labeled nucleotide by non-mass spectrometric 



methods, mass detection is faster, and less laborious without the need for modified or 
labeled bases. Mass detection offers advantages in accuracy, specificity, and 
sensitivity. Recently, a chip-based primer extension combined with mass 
spectrometry detection for genotyping was performed on a 1-uL scale in the wells 
5 contained within a microchip without using conventional sample tubes and microtiter 
plates (Tang et al., Proc Natl Acad Sci U S A 96: 10016-20 (1999)). This 
miniaturized method clearly provides another potential for high-throughput and low 
cost identification of genetic variations. 

Current methods exist for the identification of SNPs using electrospray 

10 for the mass detection of the extended primers. These methods are similar to 

MALDI-TOFMS in that mass measurements to within 1 part in 1,000 are required to 
discriminate which base extended the oligonucleotide primer. Also, electrospray 
ionization of large oligonucleotides is difficult, requiring someone highly skilled in 
the interpretation of the data. 

1 5 As SNPs are used in applications such as gene location, drug resistance 

testing, disease diagnosis, and identity testing, a concomitant increase in the rate of 
routine SNP characterization will be necessary. Pooling of DNA from ten to 
thousands of individuals into one sample before genotyping is a valuable means of 
streamlining large-scale SNP genotyping in disease association studies. The results 

20 from pooling are interpreted as a representation of the allele frequency distribution in 
the individual samples and can be used to validate a candidate SNP as common or 
rare or merely detect the presence of a particular variation in the pooled DNA sample. 
Quantitation of small molecules by electrospray ionization is well known to provide 
high sensitivity and linear responses over 3-4 orders of magnitude. The electrospray 

25 ionization/mass spectrometry procedure, in accordance with the present invention, can 
be used to accurately quantify small molecules for SNP genotyping and can provide 
an advantage when analyzing pooled DNA samples for the purpose of determining 
SNP allele frequencies. 

The present invention is a single base DNA variation detection method 

30 which overcomes the above-noted deficiencies in prior techniques. 
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SUMMARY OF THE INVENTION 

The present invention relates to a method of detecting single nucleotide 
polymorphisms by providing a target nucleic acid molecule, an oligonucleotide 
primer complementary to a portion of the target nucleic acid molecule, a nucleic acid 
5 polymerizing enzyme, and a plurality of types of nucleotide analogs. The target 
nucleic acid molecule, the oligonucleotide primer, the nucleic acid polymerizing 
enzyme, and the nucleotide analogs, each type being present in a first amount, are 
blended to form an extension solution where the oligonucleotide primer is hybridized 
to the target nucleic acid molecule to form a primed target nucleic acid molecule and 

10 the nucleic acid polymerizing enzyme is positioned to add nucleotide analogs to the 
primed target nucleic acid molecule at an active site. The oligonucleotide primer in 
the extension solution is extended by using the nucleic acid polymerizing enzyme to 
add a nucleotide analog to the oligonucleotide primer at the active site. This forms an 
extended oligonucleotide primer where the nucleotide analog being added at the 

1 5 active site is complementary to the nucleotide of the target nucleic acid molecule. 

The amounts of each type of the nucleotide analogs in the extension solution after the 
extending step are then determined where each type is present in a second amount. 
The first and second amounts of each type of the nucleotide analog are compared. 
The type of nucleotide analog where the first and second amounts differ as the 

20 nucleotide added to the oligonucleotide primer at the active site is then identified. As 
a result, the nucleotide consumed in the primer extension reaction is determined. 

Another aspect of the present invention relates to an electrospray 
system. This system includes an electrospray device which comprises a substrate 
having an injection surface and an ejection surface opposing the injection surface. 

25 The substrate is an integral monolith having an entrance orifice on the injection 
surface, an exit orifice on the ejection surface, a channel extending between the 
entrance orifice and the exit orifice, and a recess extending into the ejection surface 
and surrounding the exit orifice to define a nozzle on the ejection surface. The 
electrospray system also includes a sample preparation device positioned to transfer 

30 fluids to the electrospray device where the sample preparation device comprises a 
liquid passage and a metal chelating resin positioned to treat fluids passing through 
the liquid passage. 
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A further aspect of the present invention relates to an electrospray 
system. This system includes an electrospray device which comprises a substrate 
having an injection surface and an ejection surface opposing the injection surface. 
The substrate is an integral monolith having an entrance orifice on the injection 
5 surface, an exit orifice on the ejection surface, a channel extending between the 
entrance orifice and the exit orifice, and a recess extending into the ejection surface 
and surrounding the exit orifice to define a nozzle on the ejection surface. The 
electrospray system also includes a sample preparation device positioned to transfer 
fluids to the electrospray device where the sample preparation device comprises a 

1 0 liquid passage and a molecular weight filter positioned to treat fluids passing through 
the liquid passage. 

Yet another aspect of the present invention is directed to a reagent 
composition which includes an aqueous carrier, an oligonucleotide primer, a mixture 
of nucleotide analogs of different types, magnesium acetate, a buffer, and a nucleic 

1 5 acid polymerizing enzyme. The oligonucleotide primer is present in the reaction 

mixture in molar excess while the concentration of ddNTPs is limited. In general the 
primer concentration is four times greater than that of each ddNTP. 

Detection of the unreacted or free solution concentrations of the four 
ddNTPs offers many advantages over systems and methods described in the prior art. 

20 One of the main advantages is that by detecting the relative concentrations of the free 
ddNTPs in solution, any single-nucleotide polymorphism can be identified by only 
quantifying these four compounds. This greatly simplifies the detection technology 
required to identify SNPs. 

Another advantage of the present invention is that it permits the use of 

25 double-stranded DNA. As a result, there is no need to isolate and separate single- 
stranded DNA. Since the process of the present invention can be carried out in 
solution with free primers (i.e. primers not immobilized on a solid support), improved 
reaction kinetics are achieved. 

The present invention eliminates the complexity associated with other 

30 SNP genotyping methods described in the prior art by providing a novel primer 
extension reaction coupled with electrospray ionization (ESI)/mass spectrometry 
(MS) analysis. Nucleotide sequence variations are determined using PCR amplified 
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double- stranded DNA without the need to use modified PCR primers and to separate 
and isolate single-stranded DNA. There is no requirement for complex tagging of 
primer extension nucleotides or nucleotide bases with, for example, radioisotope 
labels or fluorescent analogs. By quantifying the unreacted ddNTPs after primer 
5 extension reactions, the present invention improves the selectivity and sensitivity of 
prior disclosed electrospray mass spectrometry systems for the detection of SNPs. 
This integrates high-throughput sample preparation and analysis using primer 
extension reactions coupled with mass spectrometry detection. The significant 
demands evolving from the modern pharmacogenetics field and the growing 

10 accumulation of identified SNPs in databases requires a much faster, accurate, 

sensitive, and effective analytical technique to identify SNPs of individuals for drug 
development. As a result failed drug development efforts can be revived, patient 
populations can be stratified, and target genes validated. The present invention will 
facilitate drug development and drug discovery in the pharmaceutical industry and 

1 5 also be useful in other important fields such as clinical and forensic science. 

Another advantage of the method of the present invention is that all 
extension reactions take place in solution phase without the requirement of 
immobilizing either the target DNA or SNP primer to a surface prior to or during 
primer extension. This can be achieved with great flexibility in the type of DNA 

20 being analyzed. More particularly, either single-stranded DNA or double-stranded 
DNA can be used without the need for a modified PCR primer to isolate a single- 
stranded DNA after PCR amplification. 

A further advantage of the present invention is the use of electrospray 
mass spectrometry for the detection of these four nucleotide analogs independent of 

25 the target nucleic acid under evaluation. Mass spectrometry methods are very specific 
and sensitive when detecting low molecular weight molecules. The instrument and 
detection method may be setup to monitor four unique ion response channels, one for 
each nucleotide analog, to screen any target nucleic acid. The electrospray mass 
spectrometry method will provide for nanomolar detection sensitivity (Poon, 

30 Electrospray Ionization Mass Spectrometry pp. 499-525 (1997), which is hereby 

incorporated by reference), thus providing a rapid, selective and sensitive method for 
SNP detection. 
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The present invention can identify homozygous and heterozygous 
SNPs in the same experiment. Particularly in heterozygous cases, two bases would be 
near-equally reduced in concentration, while the other two bases remain unchanged in 
concentration. The method described in the present invention shows that each base- 
5 reduced mixture provides proportionally reduced signal intensity for the 

corresponding base with relatively unchanged intensity for the unreacted bases. 

The extended reaction mixture, being directly analyzed by electrospray 
mass spectrometry, does not require complex sample preparation procedures required 
by other mass spectrometry-based detection methods described in the prior art, 

1 0 namely MALDI-TOFMS analysis (Haff et al., Genome Res 7: 378-88 (1997) and 
Griffin et al., Trends Biotechnol 18: 77-84 (2000), which are hereby incorporated by 
reference). The present invention decreases potential interference from suppression 
components in the extension reaction. In addition, the data analysis is less 
complicated due to the detection of the same four low molecular weight molecules for 

1 5 any SNP compared to detection of large oligonucleotides of varying composition 
using MALDI-TOFMS described in the prior art. 

The microchip-based electrospray device of the present invention 
provides minimal extra-column dispersion as a result of a reduction in the extra- 
column volume and provides efficient, reproducible, reliable, and rugged formation of 

20 an electrospray. This electrospray device is perfectly suited as a means of 

electrospray of fluids from microchip-based separation devices. The design of this 
electrospray device is also robust such that the device can be readily mass-produced 
in a cost-effective, high-yielding process. 

The present invention requires only one step of sample cleanup 

25 through solid phase extraction that can be miniaturized and automated by 96/3 84- well 
platform technology. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 Figure 1 A is a schematic drawing showing the detection of simple 

nucleotide polymorphisms in accordance with the present invention. Figures 1B-D 
show plots of relative ion intensity versus mass spectrum response. 
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Figure 2A shows a cross-sectional view of a two-nozzle electrospray 
device generating one electrospray plume from each nozzle for one fluid stream. 
Figure 2B shows a cross-sectional view of a two-nozzle electrospray device 
generating 2 electrospray plumes from each nozzle for one fluid stream. 
5 Figures 3 A-C show devices for detecting single nucleotide 

polymorphisms according to the present invention. Figure 3 A shows a reaction well 
block for performing a reaction, such as polymerase chain reaction and primer 
extension. Figure 3B shows an electrospray system which includes both the reaction 
well block of Figure 3 A together with an electrospray device. Figure 3C depicts an 

10 electrospray device with individual wells to which fluid is separately provided by a 
movable fluid delivery probe. 

Figure 4 shows an electrospray mass spectrum of ddNTPs. 
Figures 5A-D show the product ion mass spectra of the (M-PO3H2)" 
ions of (A) ddCTP, (B) ddTTP, (C) ddATP, and (D) ddGTP. 

1 5 Figures 6A-B are SRM MS/MS mass spectra for the (M-H)" ions 

collisionally dissociated to the common product ion m/z 159 and for the (M- H2PO3)" 
ions collisionally dissociated to the common product ion m/z 79, respectively. 

Figures 7A-D show an electrospray mass spectrum of a solution 
containing 1 uM ddNTPs with the ion intensities being normalized to the same value 

20 for comparison of the ion intensity dependence on the presence or absence of 

magnesium from the solution on the electrospray mass spectral data. In the mass 
spectra, the pseudomolecular ions, (M-H)", of ddCTP, ddTTP, ddATP, and ddGTP 
appear at m/z 450, 465, 474, and 490, respectively. In addition, the (M-PO3H2)" ions 
for each of the bases, ddCTP, ddTTP, ddATP, and ddGTP, appear at m/z 370, 385, 

25 394, and 410, respectively. Figure 7A shows the mass spectrum of a solution 

containing 1 uM ddNTPs in the presence of magnesium. Figure 7B shows the mass 
spectrum of a solution containing 1 uM ddNTPs with the magnesium removed using a 
metal chelating resin. Figure 7C depicts the mass spectrum of a solution containing 
1 uM ddNTPs with no added magnesium and eluted through a metal chelating resin. 

30 Figure 7D shows the mass spectrum of a solution containing 1 uM ddNTPs with no 
added magnesium (control) and not eluted through a metal chelating resin. 
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Figures 8 A-E show the SRM MS/MS mass spectra of the remaining 
free ddNTPs following primer extension reactions with varying SNP primer 
concentrations. 

Figure 9 shows the sequence of the synthetic templates (SEQ. ID. Nos. 
5 1-4) and SNP primer (SEQ. ID. No. 5) used in detecting single nucleotide 

polymorphisms in accordance with the present invention. This gene is the partial lad 
gene in pUC18, with 9 bases upstream (5') from the start codon of the lacZ gene. 

Figures 1 OA-E show the SRM MS/MS mass spectra of the remaining 
free ddNTPs following primer extension reactions which used synthetic single- 

1 0 stranded DNA as templates. 

Figures 1 1 A-E show the SRM MS/MS mass spectra- of the remaining 
free ddNTPs following primer extension reactions. These samples represent a 
duplicate set to those shown in Figures 10 A-E. The peak area ratio data for both sets 
of samples are provided in Table 2. 

1 5 Figure 12 shows the results from experimental work testing 

heterozygous cases where two polymorphic bases were present. The heterogeneous 
templates (equal molar mixture of two different single-stranded DNA templates) were 
used as targets in the primer extension reactions. All six possible combinations of 
heterogeneous templates were designed, and the ddNTPs expected to be consumed in 

20 the primer extension reaction for each set of templates are indicated. The templates 
and SNP primer were the same as in Figure 9. 

Figures 13A-G show the SRM MS/MS mass spectra of the remaining 
free ddNTPs following primer extension reactions which contained a mixture of two 
synthetic single- stranded DNA templates. 

25 Figure 14 shows the sequence of a 384bp PCR product of partial pheA 

gene (SEQ. ID. No. 6) by regular PCR amplification with a mutagenic primer, 
W338Ipd primer (SEQ. ID. No. 7), as forward primer, #1224 primer (SEQ. ID. No. 
8) as reverse primer, and pJS 1 as a template. The pJSl plasmid was constructed as 
described previously (Zhang et al., J Biol Chem 273: 6248-53 (1998), which is hereby 

30 incorporated by reference). The sequence of the 384bp double-stranded PCR product 
as well as all amplification primers and polymorphism detection primers (SEQ. ID. 
Nos. 7-12) are shown. The mutagenic bases in each primer are italicized, and the 
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bases mismatched to 384bp DNA are underlined. For each primer, the primer binding 
site to one or the other strand of the target DNA sequence is indicated by a line, and 
the direction of DNA synthesis is indicated by an arrow. The polymorphic bases for 
each detection primer are shown, and the complementary bases in the target sequence 
5 for each detection primer are shown in bold. 

Figures 15 A-E show the SRM MS/MS mass spectra of the remaining 
free ddNTPs following extension reactions using a 384bp double-stranded DNA PGR 
product as template. 

Figures 16 A-E show SRM MS/MS mass spectra of the remaining free 
1 0 ddNTPs following PCR extension reactions. These samples represent a duplicate set 
to those shown in Figure 1 5 A-E. 

Figure 17 shows a 384bp PCR product of partial pheA gene (SEQ. ID. 
No. 13) with a C374A mutation which was obtained by regular PCR amplification 
with a mutagenic primer, W338Ipd primer (SEQ. ID. No. 7), as forward primer, 
15 #1224 primer (SEQ. ID. No. 8) as reverse primer, and pSZ87 plasmid as a template 
(Pohnert et al., Biochemistry 38: 12212-7 (1999), which is hereby incorporated by 
reference). The primers are identified in Figure 14. 

Figures 1 8 A-D show the SRM MS/MS mass spectra of the remaining 
free ddNTPs following extension reactions relating to the pheA gene with the T366pd 
20 primer (SEQ. ID. No. 1 1), as described with respect to Figures 14 and 17. 

Figures 19 A-D show the SRM MS/MS mass spectra of the remaining 
free ddNTPs following extension reactions relating to the pheA gene with the V383pu 
primer (SEQ. ID. No. 12), as described with respect to Figures 14 and 17. 

Figures 20 A-B show electrospray ionization/mass spectrometry 
25 ("ESI/MS")-based primer extension genotyping dependence on single-stranded 

(Figure 20A) and double-stranded (Figure 20B) DNA template concentrations and 
cycle numbers. The reactions were performed at various concentrations of the 
synthetic single-stranded template A (SEQ. ID. No. 1) (Figure 20 A) or the 384bp 
double-stranded template (SEQ. ID. No. 6) (Figure 20B) with various thermal cycles. 
30 The other reaction reagents remained constant as described. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a method of detecting single nucleotide 
polymorphisms by providing a target nucleic acid molecule, an oligonucleotide 
primer complementary to a portion of the target nucleic acid molecule, a nucleic acid 
5 polymerizing enzyme, and a plurality of types of nucleotide analogs. The target 
nucleic acid molecule, the oligonucleotide primer, the nucleic acid polymerizing 
enzyme, and the nucleotide analogs, each type being present in a first amount, are 
blended to form an extension solution where the oligonucleotide primer is hybridized 
to the target nucleic acid molecule to form a primed target nucleic acid molecule and 

1 0 the nucleic acid polymerizing enzyme is positioned to add nucleotide analogs to the 
primed target nucleic acid molecule at an active site. The oligonucleotide primer in 
the extension solution is extended by using the nucleic acid polymerizing enzyme to 
add a nucleotide analog to the oligonucleotide primer at the active site. This forms an 
extended oligonucleotide primer where the nucleotide analog being added is 

1 5 complementary to the nucleotide of the target nucleic acid molecule at the active site. 
The amounts of each type of the nucleotide analogs in the extension solution after the 
extending step are then determined where each type is present in a second amount. 
The first and second amounts of each type of the nucleotide analog are compared. 
The type of nucleotide analog where the first and second amounts differ as the 

20 nucleotide added to the oligonucleotide primer is then identified. As a result, the 
nucleotide at the active site of the target nucleic acid molecule is determined. 

Figure 1 A is a schematic drawing showing the detection of single 
nucleotide polymorphisms in accordance with the present invention. After a sample 
is subjected to PCR amplification to increase the quantity of target nucleic acid 

25 molecule available to be detected, the PCR product is blended in Step 1 with a SNP 
primer complementary to a portion of the target nucleic acid sequence, an equimolar 
mixture of four nucleotide analogs (i.e. dideoxynucleotide triphosphates (ddNTPs), 
ddCTP, ddTTP, ddATP, and ddGTP), a DNA polymerase, and other reagents to form 
the extension solution. For example, as shown in Figure 1 , the extension solution 

30 may contain 5-50 nM of PCR product, 3-4 uM of SNP primer, 1 uM each of the 

ddATP, ddCTP, ddGTP, and ddTTP nucleotide analogs, 20 mM NH 4 Ac buffer at a 
pH of 8.7, 2 mM Mg(Ac) 2 , and 1 unit of DNA polymerase. A single nucleotide 
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analog is added to the primers that are specifically designed to anneal to the target 
region of the PCR amplified genomic DNA fragment. Once formed, the extension 
solution is subjected to 15 to 20 cycles to permit the base added to the 3' end of the 
SNP primer to be that which is complementary to the corresponding base in the target 
5 nucleotide. The amplified DNA template covers the known SNP variations that are 
located immediately at the 3 ' end of the annealing primers. 

The dideoxynucleotide base(s) complementary to the SNP base(s) is 
substantially consumed (removed) from the solution during this reaction. For 
homozygous SNPs, only one base is substantially consumed whereas for 

1 0 heterozygous SNPs, two bases are essentially consumed equally during the thermal 
cycle extension reaction. In Figure 1 A, the base in the target nucleic acid sequence 
which is susceptible to a single nucleotide polymorphism is either a T or a G. After 
the primer is extended by one base, as noted above, the extension solution is passed 
through a metal chelating resin to remove any magnesium from the solution in Step 2. 

1 5 The complementary base which is added to the primer is then determined by passing 
the extension solution as well as a control sample through an electrospray device and 
subjecting the electrospray to mass spectroscopy, as set forth in Step 3. 

This procedure can be used to quantify the concentrations of unreacted 
ddNTPs remaining in each sample. The advantage of this method is the simplified 

20 analysis of the same four analytes used for all possible SNPs. Quantification of free 
ddNTPs after SNP primer extension reactions may be made by several approaches 
including but not limited to fluorescence, ion conductivity, liquid chromatography, 
capillary electrophoresis, mass spectrometry, nuclear magnetic resonance, 
colorimetric ELISA, immuno -radio activity (IRA), radioactivity, or any combination 

25 thereof. Measurement of the unreacted nucleotide analog concentrations remaining in 
the reagent solution after primer extension relative to those in a control experiment 
allows for the immediate determination of the complementary base of the target DNA 
immediately adjacent to the 3' end of the oligonucleotide primer. 

Preferably, as shown in Step 3, using mass spectroscopy, the relative 

30 ion intensity for each of the nucleotide analogs is determined for each sample. By 
comparing the relative ion intensity of the extension solution and the control sample, 
the complementary base can be determined. In particular, that base is the base present 
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in the extension solution in an amount which is less than that present in the control 
sample. As shown in Figure IB, the control sample has equal relative intensities for 
each of the nucleotide analogs. When the sample is homozygous for the target 
nucleic acid sequence with a T at the polymorphism site, the relative intensity for the 
5 complementary base, A, is lower than for the other nucleotide analogs, as shown in 
Figure 1C. On the other hand, when the sample is heterozygous for the target nucleic 
acid sequence with a T and G at the polymorphism site, the relative intensity for the 
complementary bases, A and C, respectively, is lower than for the other nucleotide 
analogs, as shown in Figure ID. 

10 In carrying out the method of the present invention, genomic DNA can 

be extracted from whole blood, buccal epithelial cells, and saliva stain samples which 
are extracted by an alkaline method (Sweet et al., Forensic Sci Int 83: 167-77 (1996); 
Lin et al., Biotechniques 24: 937-40 (1998); Rudbeck et al., Biotechniques 25: 588- 
90, 592 (1998), which are hereby incorporated by reference). For blood, 5 uL of 

1 5 blood with 20 uL 0.2 M NaOH are incubated at room temperature for 5 min. For an 
air-dried mouth swab, a proportion of the cotton is transferred to a tube, 20 \xL of 0.2 
M NaOH are added, and incubation is carried out at 75 °C for 10 min. This extraction 
procedure is carried out by adding 180 uL 0.04 M Tris-HCl pH7.5. 5 of the above 
solution is sufficient for a subsequent 50 uL PCR reaction. 

20 PCR products are made from the target DNA by subjecting 50 uL PCR 

samples to treatment using an Expand PCR kit from Boehringer. The reaction 
mixture can contain 0.2mM dNTPs, 0.5 uM forward and reverse primers, and 20- 
100 ng of genomic DNA as the template. The PCR procedure may be conducted at 
95 °C for 1 min, 55 °C for 1 min, and 72 °C for 30 sec for 30-35 PCR cycles. The 

25 resulting PCR products are directly purified using a QIAGEN micro-column or 

Millipore Microcon-50 filter unit and further used for the later primer extension step. 

The reaction mixtures for primer extension can contain 3-4 uM SNP 
primer, 1 uM dideoxynucleotides (ddNTPs), and 50 nM synthetic single-stranded 
DNA or double- stranded PCR product as the target sequence. A reaction buffer (e.g., 

30 25mM ammonium acetate pH 9.3) with 2 mM magnesium acetate and 1 unit of 
Thermosequenase may be used for the primer extension reaction. The reaction 
mixture (10-50 uL) can be thermally cycled at 95 °C for 30 sec, 50 °C for 60 sec, and 
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72 °C for 10 sec for 20 cycles in a GeneAmp PCR System 9700 instrument. This 
solution-based assay is readily amenable to miniaturization. 

The extension reaction samples are preferably passed through a micro 
metal chelating gel column (e.g., immobilized iminodiacetic acid gel from PIERCE) 
5 to remove magnesium from the reaction mixture. The resulting samples then can be 
either directly used for MS analysis or evaporated and reconstituted into distilled 
water for electrospray mass spectrometry detection of the four ddNTPs. 

The electrospray/mass spectrometery procedure is carried out so that 
the samples are analyzed in the negative ion mode. Selected reaction monitoring 

1 0 ("SRM") mass spectrometry/mass spectrometry ("MS/MS") experiments monitor 
unique precursor-product ion transitions for each ddNTP. For ddCTP, the SRM 
transition is either m/z 450 -> m/z 159 or m/z 370 -> m/z 79. For ddTTP, the SRM 
transition is either m/z 465 -» m/z 159 or m/z 385 -> m/z 79. For ddATP, the SRM 
transition is either m/z 474 -> m/z 159 or m/z 394 -> m/z 79. For ddGTP, the SRM 

1 5 transition is either m/z 490 —> m/z 159 or m/z 410 -» m/z 79. The relative 

concentration of the ddNTPs in each sample is compared to a non-extended reaction 
control. The base(s) complementary to the consumed ddNTPs during the primer 
extension reaction can be assigned as the SNP base for both homozygous and 
heterozygous alleles based upon the relative ion responses of each of the four 

20 ddNTPs. 

Nucleotide analogs which are useful in carrying out the present 
invention by serving as substrate molecules for the nucleic acid polymerizing enzyme 
include dNTPs, NTPs, modified dNTPs or NTPs, peptide nucleotides, modified 
peptide nucleotides, or modified phosphate-sugar backbone nucleotides. 

25 The process of the present invention can be used to determine the 

single nucleotide variations of any nucleic acid molecule, including RNA, double- 
stranded or single- stranded DNA, single stranded DNA hairpins, DNA/RNA hybrids, 
RNA with a recognition site for binding of the polymerase, or RNA hairpins. 

The oligonucleotide primer used in carrying out the process of the 

30 present invention can be a ribonucleotide, deoxyribonucleotide, modified 

ribonucleotide, modified deoxyribonucleotide, peptide nucleic acid, modified peptide 
nucleic acid, modified phosphate-sugar backbone oligonucleotide, and other 



- 17- 



nucleotide and oligonucleotide analogs. It can be either synthetic or produced 
naturally by primases, RNA polymerases, or other oligonucleotide synthesizing 
enzymes. 

The nucleic acid polymerizing enzyme utilized in accordance with the 
5 present invention can be either DNA polymerases, RNA polymerases, or reverse 
transcriptases. Suitable polymerases are thermostable polymerases or thermally 
degradable polymerases. Examples of suitable thermostable polymerases include 
polymerases isolated from Thermus aquaticus, Thermus thermophilus, Pyrococcus 
woesei, Pyrococcus furiosus, Thermococcus litoralis, and Thermotoga maritima. 

10 Useful thermodegradable polymersases include E. coli DNA polymerase, the Klenow 
fragment of E. coli DNA polymerase, T4 DNA polymerase, T7 DNA polymerase, and 
others. Examples for other polymerizing enzymes that can be used to determine the 
sequence of nucleic acid molecules include E. coli, T7, T3, SP6 RNA polymerases 
and AMV, M-MLV and HIV reverse transcriptases. The polymerase can be bound to 

1 5 the primed target nucleic acid sequence at a primed single-stranded nucleic acid, a 
double-stranded nucleic acid, an origin of replication, a nick or gap in a double- 
stranded nucleic acid, a secondary structure in a single- stranded nucleic acid, a 
binding site created by an accessory protein, or a primed single-stranded nucleic acid. 

The oligonucleotide primer is present in the reagent composition in a 

20 molar excess concentration relative to the nucleotide analog concentrations. The 
oligonucleotide primer anneals to the target region of the PGR amplified genomic 
DNA template. Secondly, a nucleotide analog(s), catalyzed by DNA polymerase, 
extends the oligonucleotide primer by one nucleotide base complementary to the 
template immediately adjacent to the 3' end of the primer thus consuming the 

25 nucleotide(s) from the reagent composition. The present invention provides for the 

identification of the nucleotide analog(s) that is consumed during the primer extension 
reaction by measuring the concentration of unreacted nucleotide analogs remaining in 
the reagent composition solution after primer extension. 

In a preferred aspect of the present invention, after primer extension 

30 and before electrospraying, the extension solution is prepared for mass spectral 

analysis by first passing the reaction solution though a metal chelating resin, and then 
evaporating the effluent so that residual material is taken up in water. In order to 
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maximize the amount of this residual material that dissolves in the water, the samples 
can be subjected to sonication. Sonication is carried out using a sonicator. Typically, 
sonication for a period of 5 to 10 minutes yields adequate sensitivity for mass spectral 
analysis. 

5 Electrospray ionization provides for the atmospheric pressure 

ionization of a liquid sample (Kebaril et al., Electrospray Ionization Mass 
Spectrometry pp. 3-63 (1997), which is hereby incorporated by reference). The 
electrospray process creates highly-charged droplets that, under evaporation, create 
ions representative of the species contained in the solution. When a positive voltage is 

1 0 applied to the tip of the capillary relative to an extracting electrode, such as one 

provided at the ion-sampling orifice of a mass spectrometer, the electric field causes 
positively-charged ions in the fluid to migrate to the surface of the fluid at the tip of 
the capillary. If a negative voltage is applied to the tip of the capillary relative to an 
extracting electrode, such as one provided at the ion-sampling orifice to the mass 

1 5 spectrometer, the electric field causes negatively-charged ions in the fluid to migrate 
to the surface of the fluid at the tip of the capillary. 

When the repulsion force of the solvated ions exceeds the surface 
tension of the fluid being electrosprayed, a volume of the fluid is pulled into the shape 
of a cone, known as a Taylor cone, which extends from the tip of the capillary. A 

20 liquid jet extends from the tip of the Taylor cone and becomes unstable and generates 
charged-droplets. These small charged droplets are drawn toward the extracting 
electrode. The small droplets are highly-charged and solvent evaporation from the 
droplets results in the excess charge in the droplet residing on the analyte molecules in 
the electrosprayed fluid. The charged molecules or ions are drawn through the ion- 

25 sampling orifice of the mass spectrometer for mass analysis. This phenomenon has 
been described, for example, by Dole et al., Chem. Phys. 49:2240 (1968) and 
Yamashita et al., J. Phys. Chem. 88:4451 (1984), which are hereby incorporated by 
reference. The potential voltage required to initiate an electrospray is dependent on 
the surface tension of the solution as described by, for example, Smith, IEEE Trans. 

30 Tnd. Appl. IA-22:527-35 (1986), which is hereby incorporated by reference. 

Typically, the electric field is on the order of approximately 10 6 V/m. The physical 
size of the capillary and the fluid surface tension determines the density of electric 
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field lines necessary to initiate electrospray. Cole, Electrospray Ionization Mass 
Spectrometry: Fundamentals. Instrumentation, and Applications. (1997) summarizes 
much of the fundamental studies of electrospray. Several mathematical models have 
been generated to explain the principals governing electrospray. 
5 U.S. Patent Application Serial Nos. 09/468,535, 09/156,507, 

60/176,605, and 60/210,890, as well as the application entitled "Multiple Electrospray 
Device, Systems, and Methods", naming Gary A. Schultz, Thomas N. Corso, and 
Simon J. Prosser as inventors and filed December 30, 2000 (Express Mail No. 
EL709323020US), which are hereby incorporated by reference, disclose suitable 

10 electrospray devices as well as methods and systems of using electrospray devices to 
prepare a sample for mass spectroscopy. 

The electrospray device used in conjunction with the present invention 
includes a substrate having an injection surface and an ejection surface opposing the 
injection surface. The substrate is an integral monolith having one or more spray 

1 5 units for spraying the fluid. Each spray unit includes an entrance orifice on the 

injection surface, an exit orifice on the ejection surface, a channel extending between 
the entrance orifice and the exit orifice, and a recess surrounding the exit orifice and 
positioned between the injection surface and the ejection surface. The entrance 
orifices for each spray unit are in fluid communication with one another and each 

20 spray unit generates an electrospray of the fluid. The electrospray device also 
includes a first electrode attached to the substrate to impart a first potential to the 
substrate and a second electrode to impart a second potential. The first and the second 
electrodes are positioned to define an electric field surrounding the exit orifice. 

As shown in Figures 2A-B, to generate an electrospray, fluid may be 

25 delivered to the through-substrate channel 2 of the electrospray device 4 by, for 

example, a capillary 6, micropipette or microchip 22. Seal 24 is positioned between 
microchip 22 and electrospray device 4. The fluid is subjected to a potential voltage 
in the capillary 6 or in the reservoir 7 or via an electrode provided on the reservoir 
surface and isolated from the surrounding surface region and the substrate 8. A 

30 potential voltage may also be applied to the silicon substrate via the electrode 10 on 
the edge of the silicon substrate 8 the magnitude of which is preferably adjustable for 
optimization of the electrospray characteristics. The fluid flows through the channel 2 
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and exits from the nozzle 12 in the form of a Taylor cone 14, liquid jet 16, and very 
fine, highly charged fluidic droplets 18. 

The nozzle 12 provides the physical asperity to promote the formation 
of a Taylor cone 14 and efficient electrospray 18 of a fluid. The nozzle 12 also forms 
5 a continuation of and serves as an exit orifice of the through- wafer channel 2. The 
recessed annular region 20 serves to physically isolate the nozzle 12 from the surface. 
The present invention allows the optimization of the electric field lines emanating 
from the fluid exiting the nozzle 12 through independent control of the potential 
voltage of the fluid and the potential voltage of the substrate 8. 

1 0 The present invention also relates to a system that incorporates an 

array of reaction wells, preferably of volume less than 10 uL. The array is preferably 
in the same layout and spacing of standard 96, 384, 1536, and 6,144 well plates, 
although any array is suitable and may be optimized for a given application. The 
reaction wells contain a buffering solution, magnesium acetate, DNA polymerase, 

1 5 amplified target DNA, and SNP primer in a molar excess relative to the 

concentrations of the four ddNTPs (ddCTP, ddTTP, ddATP, and ddGTP) for 
performing SNP primer extension reactions followed by quantification of free 
ddNTPs remaining in each reaction well. 

Another aspect of the present invention relates to a reaction well block 

20 for performing a reaction, such as polymerase chain reaction and primer extension. 
As shown in Figure 3 A, this aspect of the present invention is in the form of an array 
102 of reaction wells 104 formed between plate edges 106 and/of walls 108. Wells 
104, proximate to base 110, contain frit 1 12 or other medium separating the solution 
from the metal chelating resin. Liquid is discharged from wells 104 into entrance 

25 orifice 116, through channel 118, and out of exit orifice 1 20. 

The system incorporates reaction wells with volumes on the order of 
tens of microliters to less than a microliter. The present invention has several 
advantages over other systems disclosed in the prior art. The double- stranded 
amplified target DNA fragment can be added directly to the reaction well array 

30 without prior separation of the strands. The SNP primers can be free in solution, thus 
increasing the reaction probability with the target DNA during the primer extension 
thermal cycles. The SNP primer used for each reaction is also an excess reagent 
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relative to the added amount of each of the ddNTPs, thus effectively improving the 
incorporation efficiency (rate) of the target dideoxynucleotide base(s). The ddNTPs 
are added as a limiting reagent so that the ddNTPs that react and extend the SNP 
primer will be substantially consumed from the reaction solution. The reaction 
5 solution is then passed through a metal chelating resin either on- or off-line to prepare 
the solution for electrospray mass spectrometry analysis. The relative response of the 
four ddNTP bases identifies by which base(s) the SNP primer was extended. 
Heterozygous SNPs can be identified if two ddNTP bases react with the SNP primer. 
In addition, this method can be used for discovery of the known point variation with 

1 0 both tri-allelic and tetra-allelic SNPs. 

Another aspect of the present invention relates to an electrospray 
system. This system includes an electrospray device which comprises a substrate 
having an injection surface and an ejection surface opposing the injection surface. 
The substrate is an integral monolith having an entrance orifice on the injection 

1 5 surface, an exit orifice on the ejection surface, a channel extending between the 

entrance orifice and the exit orifice, and a recess extending into the ejection surface 
and surrounding the exit orifice to define a nozzle on the ejection surface. The 
electrospray system also includes a sample preparation device, as shown in Figure 3 A, 
positioned to transfer fluids to the electrospray device where the sample preparation 

20 device comprises a liquid passage and a metal chelating resin positioned to treat fluids 
passing through the liquid passage. Instead of a metal chelating agent, the sample 
preparation device can have a molecular weight filter positioned to treat fluids passing 
through the liquid passage. 

This electrospray system is shown in Figure 3B and includes array 102 

25 of reaction wells 104 each positioned to discharge liquid into electrospray microchip 
122. In particular, each exit orifice 120 is positioned to discharge liquid into a 
particular receiving well 124 which is formed between edges 126 and/or walls 128. 
After making this transfer, solutions evaporate in receiving wells 1 24 to dryness and 
are subsequently hydrated for controlled discharge. Liquid is discharged from 

30 receiving well 124 through base 130 via entrance orifice 132, channel 134, and exit 
orifice 136. As a result, liquid is discharge from electrospray microchip 122 as an 
electrospray. Preferably, electrospray microchip 122 is positioned in front of an ion- 
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sampling orifice of an atmospheric pressure ionization mass spectrometer for analysis 
of the ddNTPs. 

Another preferred embodiment would interface a microchip-based 
array of separation channels for the detection of ddNTPs with the reaction well array. 
5 The ddNTPs may be separated by liquid chromatography or electrophoretic methods 
and quantified using spectroscopic or conductometric detection. A multi-system chip 
can be fabricated using Micro-ElectroMechanical System (MEMS) technology 
(Schultz et al, Anal Chem 72: 4058-63 (2000), which is hereby incorporated by 
reference) to further provide a rapid sequential chemical analysis system for large- 

10 scale SNP genotyping. For example, the multi-system chip enables automated, 
sequential separation and injection of a multiplicity of samples, resulting in 
significantly greater analysis throughput and utilization of the mass spectrometer 
instrument for high-throughput SNP detection. 

As shown in Figure 3B, liquid is fed into the entire depicted array 102 

15 of reaction wells 104 through conduit 132. A seal 140 is positioned between edge 106 
and conduit 138 to prevent leakage. In addition, as shown Figure 3C, a fluid delivery 
probe 142 is positioned against edges 126 and/or walls 128 by means of seal 144 to 
permit liquid to be charged to the individual receiving wells 124. After each 
receiving well is filled, probe 142 can move sequentially to the next well and fill it. 

20 In a preferred embodiment, the present invention is performed using an 

array of reaction wells. The array of reaction wells is multi-layered. The top layer 
consists of a reaction well. The middle layer has a sample cleanup phase, preferably a 
metal chelating resin, for the removal of magnesium from the reaction mixture. Also, 
a frit and a molecular weight filter may be used. The bottom layer has receiving wells 

25 in fluid communication with nozzles contained on a microchip for generating an 
electrospray of the reaction well product solution. 

Due to its sensitivity and specificity with regard to low molecular 
weight entities, mass spectrometry is preferably used for the detection of these four 
ddNTPs independent of the SNP under evaluation. The mass spectrometry instrument 

30 and detection method is setup to screen any SNP by monitoring four unique ion 

response channels, one for each ddNTP. By use of nanomolar detection sensitivity, 
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the electrospray mass spectrometry method is able to provide a rapid, selective, and 
sensitive method for SNP screening. 

A further aspect of the present invention is directed to a reagent 
composition which includes an aqueous carrier, an oligonucleotide primer, a mixture 
5 of nucleotide analogs of different types, magnesium acetate, a buffer, and a nucleic 
acid polymerizing enzyme. According to this embodiment of the present invention, 
there can be an excess of the oligonucleotide primer to nucleotide analog or there is a 
limited concentration of nucleotide analogs present in the composition. The buffer 
can be ammonium bicarbonate, ammonium acetate buffer, or mixtures thereof. 

1 0 Suitable ranges of these components in the composition are 1 - 1 50 nM of PCR 

product, 1-10 uM of SNP primer, 0.1-10 uM each of the ddATP, ddCTP, ddGTP, and 
ddTTP nucleotide analogs, 1-50 mM NH 4 Ac buffer at a pH of 8.7, 0.5-4 mM 
Mg(Ac) 2 , and 0.1-5 unit of DNA polymerase. Preferred amounts of the components 
are 50 nM of PCR product, 4 uM of SNP primer, 1 uM each of the ddATP, ddCTP, 

1 5 ddGTP, and ddTTP nucleotide analogs, 20 mM NH 4 Ac buffer at a pH of 8.7, 2 mM 
Mg(Ac) 2 , and 1 unit of DNA polymerase. 

EXAMPLES 

20 Example 1 — Mass Spectral Analyses 

By continuously infusing 10 uM ddNTPs at a rate of 10 uL/min into a 
stream of mobile phase flowing at 50 uL/min, electrospray mass spectra of the 
ddNTPs were determined. The cone voltage was 25 V, and the desolvation 

25 temperature was 400°C. The mobile phase consisted of 50/50 methanol/ water with 

0.1% acetic acid. In the mass spectrum, the pseudomolecular ions, (M-H)\ of ddCTP, 
ddTTP, ddATP, and ddGTP appeared at m/z 450, m/z 465, m/z 474, and m/z 490, 
respectively, as shown in Figure 4. In addition, the (M-PO3H2)" ions for each of the 
bases, ddCTP, ddTTP, ddATP, and ddGTP, formed by fragmentation in the source of 

30 the mass spectrometer were observed. Other ions were observed at m/z 79, 
corresponding to PO3", and m/z 159, corresponding to HP 2 06~. 
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The MS/MS product ion mass spectra of the (M-P0 3 H 2 )" ions for each 
of the four ddNTPs was obtained by continuously infusing 10 uM ddNTPs at a rate of 
10 uL/min into a stream of mobile phase flowing at 50 uL/min. The mobile phase 
consisted of 0.1% acetic acid. The (M-PO3H2)" ions were isolated and then 
5 collisionally dissociated using a collision energy of 35 eV. The cone voltage and 

desolvation temperature were maintained at 25 V and 400°C, respectively. The mass 
spectrometer was scanned over the range of 50 m/z to 420 m/z, detecting the product 
ions formed. As shown in Figures 5A-D, product ions were observed at m/z 79, 159 
and 241 for all four bases. 

1 0 Selected reaction monitoring (SRM) is an experiment where the mass 

spectrometer is set up to acquire data for a unique precursor ion to product ion 
transition for mixtures of analytes. This SRM experiment allows for unique signals to 
be obtained on analytes contained in complex mixtures without interference from 
other compounds contained within the mixture. In practice, this firstly involves the 

1 5 isolation of a precursor ion in one region of the mass spectrometer, secondly, focusing 
that ion into a collision cell to cause the ion to fragment and form product ions that 
are related to the molecular structure of the precursor ion. Thirdly, focusing the 
product ions into another region of the mass spectrometer and mass selecting one of 
the product ions formed in the collision cell for detection. 

20 SRM MS/MS mass spectra for the (M-H)" ions collisionally 

dissociated to the common product ion m/z 159 and for the (M- H 2 P0 3 )" ions 
collisionally dissociated to the common product ion m/z 79, respectively, were 
obtained by continuously infusing 10 uM ddNTPs at a rate of 10 (iL/min into a stream 
of mobile phase flowing at 50 uL/min. The mobile phase consisted of 50/50 

25 methanol/water with 0.1% acetic acid. The (M-H)" and (M- H2PO3)" ions for each of 
the four ddNTPs was first isolated and then collisionally dissociated. The product ion 
m/z 159 or m/z 79, common to all four bases, were monitored. The dwell time for 
each transition was 200 msec, the collision energy was 25 eV for (M-H)" and 35 eV 
for (M- H2PO3)", the cone voltage was 25 V, and the desolvation temperature was 

30 maintained at 400°C. For the (M-H)" ions, the SRM transitions monitored were as 
follows: ddCTP, m/z 450.1 -» m/z 159.0; ddTTP, m/z 465.1 -> m/z 159.0; ddATP, 
m/ z 474.1 _> m/z 159.0; ddGTP, m/z 490.1 -> m/z 159.0. See Figure 6A. For the 
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(M- H2PO3)" ions, the SRM transitions monitored were as follows: ddCTP, m/z 370.1 
-> m/z 79.0; ddTTP m/z 385.1 -> m/z 79.0; ddATP, m/z 394.1 -> m/z 79.0; ddGTP, 
m/z 410.1 — » m/z 79.0. See Figure 6B. The ion abundance for each transition was 
represented by the precursor ion, because the product ion m/z 79 and m/z 159 is 
5 common to all four bases. 

Example 2 — Effect of Magnesium Removal 

Well product solutions were evaporated to dryness and reconstituted in 

1 0 a 0.01% acetic acid in methanol solution to demonstrate the importance of removing 
magnesium prior to electrospray mass spectrometry of the reaction well product 
solutions. Figure 7A shows the mass spectrum of a solution of 1 uM ddNTPs (C, T, 
A, G) in 20 mM ammonium acetate pH 8.7, 1 mM magnesium acetate. Note the 
absence of a signal in the mass spectrum for each of the ddNTPs. Figure 7B shows 

1 5 the mass spectrum of this same solution passed through a metal chelating resin based 
on iminodiacetic acid (IDA) functional groups used to complex with metals including 
magnesium. The metal chelating resin removes the magnesium from the solution 
resulting in a measurable signal for each of the ddNTPs as labeled in the mass 
spectrum. Figure 7C shows the mass spectrum of a solution of 1 uM ddNTPs (C, T, 

20 A, G) in 20 mM ammonium acetate pH 8.7 without magnesium acetate and also that 
was passed through the metal chelating resin. Figure 7D shows the mass spectrum of 
a solution of 1 uM ddNTPs (C, T, A, G) in 20 mM ammonium acetate pH 8.7 that has 
only been evaporated to dryness and reconstituted prior to electrospray mass 
spectrometry analysis. Note that there is no difference between the relative ion 

25 intensities for the four ddNTPs of the control experiment in Figure 7D to that in 

Figure 7C, indicating that the IDA metal chelating resin does not adversely adsorb the 
ddNTPs. In Figure 7B, with the presence of magnesium, the measured signals were 
reduced to approximately one-half the control shown in Figure 7D. In the case of 
Figure 7A, where high levels of magnesium were present in the solution, the 

30 formation of ddNTP ions using electrospray was markedly reduced. 
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Example 3 — Optimization of primer extension reaction 



To simplify the primer extension reaction, a synthetic oligonucleotide, 
template A, (5' CCCCTGTATCCTGTGTGAAATTGTTATCCGCTC 3' (SEQ. ID. 
5 No. 1) 33mer) corresponding to the flanking region of the poly-restriction sites of 
pUC18/19 plasmid, was used as a target template. A universal primer #1233 (5' 
AGCGGATAACAATTTCACACAGGA 3 ' (SEQ. ID. No. 5) 24mer) which is a 
complement to the above synthetic template, was used as the SNP primer. The 
reaction was set up in a total volume of 50 pL with 25 mM ammonium acetate buffer 

10 pH 9.3, 1 uM ddNTPs, 2 mM magnesium acetate, 0.1 uM template A, and 1 unit of 
Thermoequenase (Amersham). The #1233 primer was varied at concentrations of 0 
uM, 1 uM, 2 uM, 3 uM, and 4 uM in the reaction for a total of five samples. The 
reaction mixture was subjected to 25 thermal cycles in a GeneAmp PGR System 9700 
(PE Biosystem) with each cycle consisting of 95°C for 30 sec, 60°C for 60 sec, and 

1 5 72°C for 60 sec. The extended reaction samples were passed through Ultrafree-0.5 
filter units (Millipore) and a micro metal chelating column composed of immobilized 
iminodiacetic acid gel (Pierce). The resulting samples were analyzed by electrospray 
ionization coupled to a triple quadrupole Quattro II (Micromass) mass spectrometer 
(ESI-MS/MS). A mobile phase composition of 1 : 1 methanol: water with 0.1% acetic 

20 acid was used at a flow rate of 1 50 pL/min. At least three 1 0 pL injections were 
made for each sample via loop injection into the mobile phase. The mass 
spectrometer was operated in MS/MS selected reaction monitoring (SRJVI) mode for 
each base. The following SRM transitions were monitored for each of the bases: 
ddCTP, m/z 370.1 -> m/z 79.0; ddTTP, m/z 385.1 -> m/z 79.0; ddATP, 

25 m/z 394.1 -> m/z 79.0; ddGTP, m/z 410.1 -> m/z 79.0. 

Example 4 — Determination of Suitable Primer Concentration 

To determine what concentration of primer should be used, an ESI- 
30 MS/MS spectra for the above five samples was determined. In these experiments, the 
extension reaction mixtures each contained 1 uM ddNTPs, 2.5 units of 
Thermosequenase (Amersham), 2 mM magnesium acetate, 25 mM ammonium acetate 
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pH 9.3, 0.1 uM template A (sequence shown in Figure 9), and varying concentrations 
of SNP primer (sequence shown in Figure 9). The concentrations of primer in the 
reactions for Figures 8B, C, D, and E, were 1 uM, 2 uM, 3uM, and 4 uM, 
respectively. The control reaction, shown in Figure 8A, was identical to the reaction 
5 Figure 8D, except that the Thermosequenase was omitted. The primer extension 
reaction consisted of 25 cycles with each cycle composed of a 30 sec denaturing step 
at 95°C, a 60 sec annealing step at 60°C, and a 60 sec extension step at 72°C. The 
extension reaction samples were prepared by filtering with an Ultrafree - 0.5 micron 
filter unit followed by solid phase extraction using an immobilized iminodiacetic acid 

10 gel column. With template A, the SNP base was A. Therefore, following the 
extension reaction, it was expected that the concentration of ddTTP, which 
corresponds to the transition m/z 385.1 -» m/z 79.0, would decrease due to its 
incorporation at the 3' end of the primer. The mass spectral data showed that as the 
primer concentration increased, the consumption of ddTTP in the extension reaction 

1 5 also increased, resulting in a decrease in the ion abundance of transition m/z 3 85 . 1 — » 
m/z 79.0. These data reveal that the optimal primer concentration is 4 uM in the 
primer extension reaction. The relative peak area ratios of the various transitions are 
displayed in Table 1 . 

Table 1. Summary of the Peak Area Ratios of PCR Extension Reaction Samples 
20 which Contained Varying Primer Concentrations. 

Sample Peak Area Ratios 



370/385 



370 / 394 



370/410 



385 /394 



385 /410 



394/410 



1 |iM Primer 

2 |j,M Primer 

3 |dM Primer 

4 yiM Primer 



Control 



0.94 
1.59 
2.48 
5.40 
6.73 



0.95 
0.89 
0.97 
0.90 
0.94 



1.93 
1.83 
1.69 
1.58 
1.65 



1.01 
0.56 
0.39 
0.17 
0.14 



2.05 
1.16 
0.68 
0.30 
0.25 



2.03 
2.05 
1.74 
1.75 
1.76 



Note: 



25 



370 denotes the transition 
385 denotes the transition 
394 denotes the transition 
410 denotes the transition 



m/z 370.1 m/z 79.0 
m/z 385.1 -» m/z 79.0 
m/z 394.1 m/z 79.0 
m/z 410.1 m/z 79.0 
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By mathematically adjusting the relative ratios of the bases for all reactions, it is 
estimated that up to 86% of the initial ddTTP reacted to extend the primer in the 
sample containing 4 jjM SNP primer. 

5 Example 5 — SNP Assay Using Synthetic Oligonucleotides as Homozygous 
Templates. 

To determine the utility of this SNP assay, a model system was 
adopted with one SNP primer (#1233 primer) (SEQ. ID. No. 5) and four synthetic 33- 

1 0 mer templates (SEQ. ID. Nos. 1-4). These four templates differed by only one SNP 
base, A, C, G, or T, as shown in Figure 9. To detect all possible SNP base alterations 
for homozygous cases, 0.1 uM of each of the four templates were used for SNP 
extension reactions under aforementioned conditions. A universal reverse primer 
#1233 (BioLabs) was used for extension. The polymorphic site (A) at position 8 is 

1 5 shown in bold and italics in template A. Other targets including template C, template 
G, and template T were identical to template A except for a C, G, or T at position 8, 
respectively. The ddNTP expected to be consumed in the primer extension reaction 
for each template is indicated. Figure 9 shows the results of SNP genotyping by ESI- 
MS/MS using synthetic single-stranded DNA as target templates. All reactions, 

20 including control samples that did not contain template were run in duplicate. 

To ensure that the technique of the present invention would correctly 
identify the four possible SNP bases, A, C, G, and T, four different templates whose 
sequences are shown in Figure 9, were synthesized. These templates differed from 
one another only by one base at position 8 and were named by this polymorphic base, 

25 so that the same primer could be used in the extension reaction for all four templates. 
The extension reaction mixtures each contained 1 uM ddNTPs, 1 .25 units of 
Thermo sequenase, 2 mM magnesium acetate, 25 mM ammonium acetate pH 9.3, 0.2 
uM template, and 4 uM primer. These reactions differed from one another only by 
the particular template used in each. The control reaction in Figure 10A was identical 

30 to the others except that it did not contain template. The extension reaction was 

thermally cycled for 25 cycles with each cycle composed of a 30 sec denaturing step 
at 95 °C, a 60 sec annealing step at 60°C, and a 60 sec extension step at 72°C. The 
extension reaction samples were prepared for mass spectral analysis by filtering with 
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an Ultrafree - 0.5 micron filter unit followed by solid phase extraction using an 
immobilized iminodiacetic acid gel column. The reaction in Figure 10B contained 
template A which has the SNP base A. Therefore, during the extension reaction, it 
was expected that ddTTP, corresponding to the transition m/z 385.1 ->■ m/z 79.0, 
5 would be incorporated into the primer. The resulting decrease in intensity of the m/z 
385.1 -> m/z 79.0 transition is shown in the reaction of Figure 10B. In the reaction of 
Figure 10C, template C having the SNP base C, was used. Here, it was expected that 
following the extension reaction, ddGTP, corresponding to the transition m/z 
410.1-»m/z 79.0. would be consumed. This was observed in the reaction of Figure 

10 10C, with a significant decrease in the ion intensity of ddGTP, m/z 4l0A\-^m/z 79.0. 
Template G, with SNP base G, was used in extension reaction Figure 10D. A 
decrease in ddCTP, corresponding to the transition m/z 370.1— »m/z 79.0, was 
expected and observed. Finally, the last possible SNP base T, in template T was used 
in the reaction of Figure 10E. Here, it was expected that ddATP, m/z 394.1->m/z 

1 5 79.0, would be incorporated into the primer. A decrease in the ion intensity of m/z 
394.1-»m/z 79.0 was observed in the reaction of Figure 10E. These results show that 
the analysis of the present invention can unambiguously identify the four possible 
bases. The relative peak area ratios of the various transitions are displayed in Table 2. 
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Table 2. Summary of the Peak Area Ratios of PCR Extension Reaction Samples 
Containing Homogeneous Single-Stranded DNA Template. The four templates used 
were named by their polymorphic base. Samples were prepared in duplicate. 



Peak Area Ratios 



Sample 


370/385 


370 / 394 


370/410 


385 /394 


385/410 


394/410 


Control 


1.31 


0.97 


1.40 


0.86 


1.24 


1.44 


Control 


1.03 


0.98 


1.38 


0.95 


1.33 


1.40 


Template A 


11.96 


0.93 


1.26 


0.08 


0.11 


1.35 


Template A 


12.93 


1.07 


1.50 


0.08 


0.12 


1.41 


Template C 


1.24 


1.05 


10.20 


0.85 


8.24 


9.64 


Template C 


1.23 


1.07 


9.77 


0.87 


7.96 


9.18 


Template G 


0.17 


0.16 


0.23 


0.93 


1.36 


1.46 


Template G 


0.19 


0.17 


0.26 


0.89 


1.34 


1.50 


Template T 


1.05 


7.62 


1.52 


7.24 


1.45 


0.20 


Template T 


1.05 


7.20 


1.43 


6.87 


1.36 


0.20 



Note: 

5 370 denotes the transition m/z 370. 1 m/z 79.0 
385 denotes the transition m/z 385.1 -» m/z 79.0 
394 denotes the transition m/z 394.1 -» m/z 79.0 
410 denotes the transition m/z 410.1 -> m/z 79.0 



1 0 Figure 1 1 shows the results from the duplicate set of samples. Both 

Figures 1 0 and 1 1 show identical results with the expected bases consumed by 70- 
80% of their initial concentration. Therefore, this method of SNP analysis provides 
unambiguous identification of all possible single (homozygous) SNP bases. 

15 Example 6 — SNP Assay Using Synthetic Oligonucleotides as Heterozygous 
Templates. 

To mimic the double SNP base changes in heterozygous cases, 
mixtures of the 33-mer templates (SEQ. ID. Nos. 1-4) outlined in Figure 9 were 

20 combined at a concentration of 0.025 uM for each of two templates in the SNP 

extension reactions with the SNP primer (SEQ. ID. No. 5), as shown in Figure 12. 
The results from duplicate sample preparations for the heterozygous cases are shown 
in Figure 13. These samples represent heterozygous cases where two polymorphic 
bases are simultaneously present. These samples represent all possible heterozygous 

25 possibilities. The equal molar mixture of two single-stranded DNA templates, named 
by the SNP bases that were used in previous experiment were also used here. The 
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extension reaction mixtures each contained 1 uM ddNTPs, 1.25 units of 
Thermosequenase, 2 mM magnesium acetate, 25 mM ammonium acetate pH 9.3, 4 
uM primer, and 0. 1 uM each of two different templates. The particular templates 
used in each reaction are provided in Figure 12. The control reaction was identical to 
5 the others except that it did not contain any template. The extension reaction was 
thermally cycled for 25 cycles with each cycle composed of a 30 sec denaturing step 
at 95°C, a 60 sec annealing step at 60°C, and a 60 sec extension step at 72°C. The 
extension reactions samples were prepared for mass spectral analysis by filtering with 
an Ultrafree - 0.5 micron filter unit followed by solid phase extraction using an 

1 0 immobilized iminodiacetic acid gel column. When comparing the reaction of Figure 
13B to the reaction of Figure 13 A, it is apparent that ddTTP, corresponding to the 
transition m/z 385.1 -» m/z 79.0, and ddGTP, corresponding to the transition m/z 
410.1 ->• m/z 79.0, have decreased in intensity. This is consistent with what would be 
expected when polymorphic bases A and C are present as they were in the reaction of 

1 5 Figure 1 3B. Templates A and G were present in the reaction of Figure 13C, and, as 
expected ddTTP, m/z 385.1 -» m/z 79.0, and ddCTP, m/z 370.1 -> m/z 79.0, decreased 
in ion intensity. In the reaction of Figure 13D, the SNP bases are A and T, and the 
corresponding ddTTP, m/z 385.1 -> m/z 79.0, and ddATP, m/z 394.1 -> m/z 79.0, 
were observed to decrease in intensity. The presence of templates C and G in the 

20 reaction of Figure 1 3E, resulted in the expected decrease in ion abundance of ddGTP, 
m/z 410.1 -» m/z 79.0, and ddCTP, m/z 370.1 -> m/z 79.0. In the reaction of Figure 
13F, ddATP, m/z 394.1 -> m/z 79.0, and ddGTP, m/z 410.1 -> m/z 79.0, decreased in 
intensity, corresponding to the expected consumption of ddATP and ddGTP in the 
primer extension reaction in the presence of polymorphic bases T and C. In the 

25 reaction of Figure 13G, templates G and T, corresponding to polymorphic bases, G 
and T, the expected decrease in ion abundance of ddCTP, m/z 370.1 -> m/z 79.0, and 
ddATP, m/z 394.1 -> m/z 79.0 was achieved. This experiment shows that this 
analysis technique can be used to determine the polymorphic bases in heterozygous 
cases. In each sample, both of the bases expected to decrease in concentration did in 

30 fact decrease, with each base consumed by 70-80% of its initial concentration despite 
the fact that only half the amount of each template was added. This result reveals that 
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all possible combinations for heterozygous polymorphisms can be easily identified by 
the method of the present invention and, in addition, that the 25 thermal cycles used 
for the extension reaction are in kinetic excess for efficient incorporation of the free 
ddNTPs. Table 3 lists the peak area ratios for all duplicate samples in the 
5 heterogeneous reactions. 



Table 3. Summary of the Peak Area Ratios of PCR Extension Reaction Samples 
Containing Heterogeneous Single-Stranded DNA Templates. This data mimics 
heterozygous cases. The four templates used were named by their polymorphic base. 
Samples were prepared in duplicate. 



Peak Area Ratios 



Sample 

Control 
Control 

Template A + C 
Template A + C 
Template A + G 
Template A + G 
Template A + T 
Template A + T 
Template C + G 
Template C + G 
Template C + T 
Template C + T 
Template G + T 
Template G + T 



10 Note: 

370 denotes the transition m/z 370.1 ~> m/z 79.0 
385 denotes the transition m/z 385.1 m/z 79.0 
394 denotes the transition m/z 394.1 m/z 79.0 
410 denotes the transition m/z 410.1 m/z 79.0 



370/385 


370/394 


370/410 


385 / 394 


385 /410 


394/410 


0.92 


0.97 


1.28 


1.06 


1.40 


1.32 


0.80 


0.97 


1.19 


1.22 


1.49 


1.23 


8.52 


1.27 


8.79 


0.15 


1.04 


6.94 


9.06 


1.22 


8.63 


0.14 


0.98 


7.08 


1.30 


0.15 


0.21 


0.12 


0.16 


1.38 


1.11 


0.15 


0.21 


0.14 


0.19 


1.37 


7.36 


5.52 


1.46 


0.75 


0.20 


0.27 


7.01 


5.98 


1.70 


0.90 


0.25 


0.29 


0.16 


0.18 


1.13 


1.09 


6.89 


6.32 


0.11 


0.13 


0.85 


1.14 


7.77 


6.78 


1.20 


6.15 


6.63 


5.16 


5.54 


1.10 


1.31 


6.95 


9.52 


5.30 


7.29 


1.40 


0.17 


1.44 


0.30 


8.35 


1.73 


0.21 


0.17 


1.17 


0.26 


6.74 


1.52 


0.23 



15 Using the peak area ratios for all combinations of the four oligonucleotide bases 
allows for the detection of changes in the relative concentrations of the bases. 
Through data analysis, the nature of the SNP locus is readily determined as either a 
homozygous or heterozygous polymorphism. Furthermore, the relative standard 
deviation of the peak area ratio data for each sample and its duplicate, encompassing 

20 six injections was typically less than 15%, suggesting this method of genotyping 
SNPs by detecting free ddNTPs is reproducible. 
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Example 7 — SNP Assay Using Amplified Double-Stranded DNA as Template. 

The model system described previously consisted of a single-stranded 
DNA target sequence. However, from a practical standpoint, double-stranded DNA 
5 will be encountered more often. A potential problem for using double-stranded DNA 
is the reannealing of the two complementary strands that could compete with the SNP 
primer and thereby lower the rate of the extension reaction. To determine whether the 
method of the present invention is applicable to double- stranded DNA, amplified 
double-stranded DNA was used as the template in a primer extension reaction. An 

10 E.coli PheA gene was cloned in pUC18 to make a pJSl plasmid (Zhang et al., J Biol 
Chem 273: 6248-53 (1998), which is hereby incorporated by reference). A 384bp 
portion of partial E.coli PheA gene (SEQ. ID. No. 6) was amplified by regular PGR 
using this pJSl as a template along with W338Ipd (SEQ. ID. No. 7) as the forward 
primer and #1224 (SEQ. ID. No. 8) as the reverse primer. The PCR amplification 

1 5 utilized AmpliTaq DNA polymerase and a GeneAmp PCR System 9700 (PE 

Biosystem). The amplification was performed in 35 thermal cycles with each cycle 
consisting of 95°C for 30 sec, 60°C for 60 sec, and 72°C for 60 sec. The resulting 
PCR product was passed through a Microcon-50 filter unit (Millipore) to isolate the 
384bp template from the residual free dNTPs and primers. The concentrated 384bp 

20 PCR product was then quantified spectrophotometrically (OD260nrn) and used for the 
following extension reaction. 

The extension reaction samples contained 0.05 uM of the 384bp 
double-stranded DNA, 25 mM ammonium acetate buffer pH 9.3, 1 uM ddNTPs, 2 
mM magnesium acetate, and 1 unit of Thermosequenase. Four SNP primers, 

25 W338Ipd, C374Spu, #1224, and C374Apd (SEQ. ID. Nos. 7-10), that are capable of 
annealing to the 384bp target sequence (Pohnert et al., Biochemistry 38: 12212-17 
(1999), which is hereby incorporated by reference, as shown in Figure 14, were used 
in individual reactions at 4 uM concentration. 

Four different primers were used in individual reactions with the same 

30 384 bp double-stranded DNA template. The extension reactions shown in Figures 15 
B to E each contained 1 \xM ddNTPs, 1.25 units of Thermosequenase, 2 mM 
magnesium acetate, 25 mM ammonium acetate pH 9.3, 4 uM primer, and 0.1 uM 384 



bp template. The control reaction was identical to the others except that it did not 
contain any Thermosequenase. The extension reaction was run for 35 cycles with 
each cycle composed of a 40 sec denaturing step at 95°C, a 60 sec annealing step at 
63°C, and a 60 sec extension step at 72 °C. The extension reaction samples were 
5 prepared for mass spectral analysis by filtering with an Ultrafree-0.5 micron filter unit 
followed by solid phase extraction using an immobilized iminodiacetic acid gel 
column. In the reaction shown in Figure 15B, the primer W338Ipd, having the 
polymorphic base T, was used. It was observed from the MS/MS spectrum in Figure 
15B that ddATP, m/z 394.1 -> m/z 79.0, decreased in ion intensity which was 

1 0 expected. The primer C374Spu, was used in the reaction shown in Figure 1 5C. This 
primer has C as its SNP base, so that ddGTP, m/z 410.1 m/z 79.0, was expected to 
decrease in intensity. In the reaction shown in Figure 15C, ddGTP was in fact 
observed to decrease in intensity. In the reaction shown in Figure 15D, primer #1224 
with the polymorphic base G was used. The expected decrease in ddCTP, m/z 370.1 

1 5 m / z 79.0, was observed. The primer C374Apd was used in the reaction shown in 
Figure 1 5E. This primer has the polymorphic base T, and, therefore, it was expected 
that ddATP, m/z 394.1 -» m/z 79.0, would decrease in intensity. This was exactly 
what was observed in the reaction shown in Figure 15E. Consequently, this analysis 
technique works equally well with single and double-stranded DNA. Table 4 shows 

20 the peak area ratios of the bases for the control sample compared to the four different 
SNP primer reactions. 
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370/385 


370 / 394 


370/410 


385 /394 


385/410 


394/410 


0.93 


0.92 


1.50 


0.99 


1.63 


1.65 


0.91 


0.95 


1.40 


1.04 


1.54 


1.48 


1.28 


4.58 


1.99 


3.53 


1.56 


0.44 


1.11 


3.18 


1.81 


2.86 


1.63 


0.57 


1.08 


1.01 


3.10 


0.93 


2.87 


3.09 


0.99 


1.06 


3.77 


1.07 


3.80 


3.56 


0.24 


0.22 


0.39 


0.91 


1.61 


1.78 


0.20 


0.20 


0.34 


1.04 


1.73 


1.66 


1.18 


2.27 


2.06 


1.91 


1.74 


0.91 


0.99 


1.96 


1.63 


1.99' 


1.65 


0.83 



Table 4. Summary of the Peak Area Ratios of PCR Extension Reaction Samples 
Containing Homogeneous Double-Stranded DNA Template. Samples were prepared 
in duplicate. 

Peak Area Ratios 

Sample 

Control - No Enzyme 
Control - No Enzyme 
Primer = W3 3 8Ipd 
Primer = W3 3 8Ipd 
Primer = C374Spu 
Primer = C3 74 Spu 
Primer = #1224 
Primer = #1224 
Primer = C374Apd 
Primer = C3 74 Apd 

Note: 

370 denotes the transition m/z 370.1 -» m/z 79.0 
385 denotes the transition m/z 385.1 -> m/z 79.0 
394 denotes the transition m/z 394.1 m/z 79.0 
410 denotes the transition m/z 410.1 -> m/z 79.0 

By comparing the peak area ratios of the control samples to those samples containing 
enzyme, the SNP bases can be unambiguously identified using double-stranded DNA 
as a template. All expected results, predicted in Figure 15, were observed with each 
base consumed by more than 60%. For example, the primer W338Ipd has the SNP 
base T, and the concentration of only ddATP was found dramatically reduced, as 
shown in Figure 15B, while the other ddNTP bases remained unchanged. Therefore, 
earlier concerns of reannealing of the two complementary DNA strands competing 
with the annealing of the primer are unsubstantiated. Once again, the relative 
standard deviation of each sample and its duplicate was typically less than 15%. 

The above set of reactions was repeated, with the exception that the extension 
reaction samples were not passed through an Ultrafree-0.5 micron filter unit prior to 
treatment with the iminodiacetic acid gel column. This omission in the sample 
preparation process lead to an overall increase in sensitivity. In this set of reactions, 
double-stranded DNA was used as a template. Four different primers were used in 
individual reactions with the same double- stranded DNA 384bp template. The 
extension reactions shown in Figures 16 B-E each contained 1 uM ddNTPs, 1.25 units 
of Thermosequenase, 2 mM magnesium acetate, 25 mM ammonium acetate pH 9.3, 4 
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uM primer, and 0.1 jaM 384bp template. The control reaction was identical to the 
others except that it did not contain any Thermosequenase. The extension reaction 
was run for 35 cycles with each cycle composed of a 40 sec denaturing step at 95°C, a 
60 sec annealing step at 63 °C, and a 60 sec extension step at 72°C. The extension 
5 reaction samples were prepared for mass spectral analysis simply by solid phase 

extraction (SPE) using an immobilized iminodiacetic acid gel column. In the reaction 
shown in Figure 16B, the primer W338Ipd, having the polymorphic base T, was used. 
It was observed from the MS/MS spectrum in Figure 16B that ddATP, m/z 394.1 -> 
m/z 79.0, decreased in ion intensity which was expected. The primer C374Spu, was 

1 0 used in the reaction of Figure 1 6C. This primer has C as its SNP base, so that ddGTP, 
m/z 410.1 -> m/z 79.0, was expected to decrease in intensity. In the reaction shown in 
Figure 16C, ddGTP was in fact observed to decrease in intensity. In the reaction 
shown in Figure 16D, primer #1224 with the polymorphic base G was used. The 
expected decrease in ddCTP, m/z 370.1 -> m/z 79.0, was observed. The primer 

1 5 C374Apd was used in reaction shown in Figure 1 6E. This primer has the 

polymorphic base T, and, therefore, it was expected that ddATP, m/z 394.1 m/z 
79.0, would decrease in intensity. This was exactly what was observed in the reaction 
shown in Figure 16E. In this set of reactions, it was determined that filtering prior to 
the SPE treatment was not necessary and that higher sensitivity was obtained for 

20 extension reaction samples that are not filtered. The peak area ratio results of the data 
shown in Figure 16 is summarized in Table 5. 
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Table 5. Summary of the Peak Area Ratios of PCR Extension Reaction Samples 
Containing Homogeneous Double-Stranded DNA Template. These samples were not 
filtered before treatment with IDA columns. 

Peak Area Ratios 



Sample 370/385 370/394 370/410 385 /394 385/410 394/410 

Control 0.59 1.11 0.93 1.88 1.58 0.84 

Primer = W3 3 8Ipd 0.79 51.13 1.42 64.87 1.79 0.03 

Primer = C374Spu 0.69 1.35 3.33 1.94 4.79 2.47 

Primer = #1224 0.10 0.20 0.18 1.98 1.79 0.90 

Primer = C374Apd 0.71 6.82 1.31 9.55 1.84 0.20 



Note: 

5 370 denotes the transition m/z 370. 1 -» m/z 79.0 
385 denotes the transition m/z 385.1 -» m/z 79.0 
394 denotes the transition m/z 394.1 -> m/z 79.0 
410 denotes the transition m/z 410.1 — > m/z 79.0 



1 0 Example 8 — Detection of pheA Gene Mutations 

A 384bp PCR product of partial pheA gene with a C374A mutation 
(SEQ. ID. No. 13) was constructed by site-directed mutagenesis and amplified by 
PCR amplification with a mutagenic primer, W338Ipd primer (SEQ. ID. No. 7), as 

15 forward primer, #1224 primer (SEQ. ID. No. 8) as reverse primer, and pSZ87 plasmid 
as a template. The pSZ87 plasmid containing the C374A mutation in the parent 
vector pJSl was constructed as described (Pohnert et al., Biochemistry 38: 12212-17 
(1999), which is hereby incorporated by reference). The sequence of the double- 
stranded 384bp-C374A mutant PCR product is shown in Figure 17, in which three 

20 site-directed mutated bases are shown in italics. The sequence of two amplification 
primers and two polymorphic detection primers are included. For each primer, the 
primer binding site to one or the other strand of the target DNA sequence is indicated 
by a line, and the direction of DNA synthesis is indicated by an arrow. The 
polymorphic bases for each detection primer are listed and the complementary bases 

25 in the target sequence for each detection primer is shown in bold. An equal molar 

mixture of 384bp wild type (SEQ. ID. No. 6) and C374A mutant DNA (SEQ. ID. No. 
13) is used as a template to further demonstrate this method for detection of 
heterogeneous polymorphic bases. To identify two SNP bases in the heterogeneous 
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reactions, two additional SNP primers, T366pd (SEQ. ID. No. 1 1) and V383pu (SEQ. 
ID. No. 12) were synthesized and used for the heterogeneous assay as shown in 
Figure 17. 

In this set of reactions, T366pd was used as the primer. Two different 
5 3 84 bp DNA templates were used. The extension reactions each contained 1 uM 
ddNTPs, 1 .25 units of Thermosequenase, 2 mM magnesium acetate, 25 mM 
ammonium acetate pH 9.3, 4 uM T366pd primer, and 0.12 uM 384bp template. The 
control reaction was identical to the others except that it did not contain any 
Thermosequenase. The results for this reaction are shown in Figure 18A. The 

1 0 extension reaction was run for 35 cycles with each cycle composed of a 40 sec 

denaturing step at 95°C, a 60 sec annealing step at 63 °C, and a 60 sec extension step 
at 72°C. The extension reaction samples were prepared for mass spectral analysis 
simply by solid phase extraction using an immobilized iminodiacetic acid gel column. 
Filtering prior to the SPE treatment was not performed. In Figure 1 8B, wild type 

1 5 384bp DNA was used as the template, and, consequently, the polymorphic base was 
A. The results in Figure 18B indicate that the expected consumption of free ddTTP 
occurred. Figure 18C shows the resulting mass spectrum from a reaction with C374A 
mutant DNA template. In this example, C becomes the SNP base, and the expected 
decrease in intensity of ddGTP was observed. Figure 1 8D shows the resulting mass 

20 spectrum when both templates were added in an equal molar ratio such that the 

combined concentration of DNA template remained 0.12 uM. This situation closely 
resembled any heterozygous case that could be encountered. Both polymorphic bases 
A and C were present in this mixture. The SRM MS/MS mass spectrum of the 
remaining free ddNTPs after the PCR extension reaction showed that the ion current 

25 for both ddTTP and ddGTP decreased in intensity, as predicted. It was calculated that 
ddTTP and ddGTP were consumed approximately 48% and 38%, respectively. 
Consequently, this analysis technique can unambiguously identify the polymorphic 
bases in double-stranded DNA for both homozygous and heterozygous cases. 

The above reaction steps were repeated with V383pu being used as the 

30 primer. Two different 384 bp DNA templates were used. In Figure 19B, wild type 

384bp DNA was used as the template, and, consequently, the polymorphic base was T 
and the expected consumption of free dd ATP occurred. Figure 1 9C shows the 
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resulting mass spectrum from a reaction with C374A mutant DNA template. Here, C 
became the SNP base, and the expected decrease in intensity of ddGTP was observed. 
Figure 1 9D shows the resulting mass spectrum when both templates were added in an 
equal molar ratio such that the combined concentration of DNA template remained 
5 0.12 uM. This situation closely resembled any heterozygous case that could be 
encountered. Both polymorphic bases T and C were present in this mixture. The 
SRM MS/MS mass spectrum of the remaining free ddNTPs after the PGR extension 
reaction showed that the ion current for both ddATP and ddGTP decreased in 
intensity, as predicted. It was calculated that ddATP and ddGTP were consumed 
10 approximately 42% and 32 %, respectively, in this reaction. Consequently, this 
analysis technique can unambiguously identify the polymorphic bases in double- 
stranded DNA for both homozygous and heterozygous cases. 

A summary of the mean peak area ratios and standard deviations for 
the results shown in Figure 1 8 and Figure 19 are listed in Table 6. 
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In addition, the mean + standard deviation is provided for several other 
samples studied. Table 7 lists the mathematically normalized percent of free 
dideoxynucleotide bases remaining in solution following primer extension reactions 
for the results shown in Figure 18 and Figure 19. 

5 Table 7. Mathematically normalized percent of free dideoxynucleotide bases 
remaining following primer extension reactions shown in Figures 16 and 17. 

Sample (3 84bp template-primer) Consumed bases n Mean±SD 



Homogeneous template 



Wild type-T366pd 


ddTTP 


9 


23.0 ±2.5 


Wildtype-V383pu 


ddATP 


9 


21.4 ±4.5 


C374Amutant-T366pd 


ddGTP 


9 


32.4 ± 4.2 


C374A mutant V383pu 


ddGTP 


9 


30.1 ±5.1 


Heterogeneous template 








Wild type+C374A mutant-T366pd 


ddTTP 


6 


48.1 ±4.6 




ddGTP 


6 


37.7 ±3.3 


Wild type+C374A mutant-V383pu 


ddATP 


6 


42.2 ±7.9 




ddGTP 


6 


31.8±4,5 



These results clearly demonstrate the feasibility of using ESI-MS/MS 
for SNP genotyping by monitoring unreacted dideoxynucleotides remaining in the 
1 0 solution and provide evidence that any known SNP can be analyzed by this technique. 

Example 9 - Dependence of ddNTP Consumption on Template Concentration 
and Number of Thermal Cycles 

1 5 To optimize the primer extension conditions for the most efficient 

incorporation of ddNTPs into the SNP primer, a series of primer extension reactions 
were performed varying both the single-stranded template A (SEQ. ID. No. 1) 
concentration from 5 to 100 nM and the 384bp double-stranded DNA (SEQ. ID. No. 
6) concentration from 5 to 150 nM. In addition to varying the template concentration, 

20 the number of thermal cycles was varied between 10 and 60 cycles for every 
concentration of template. 
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For the single- stranded DNA experiment, template A (5 ' 
CCCCTGTATCCTGTGTGAAATTGTTATCCGCTC 3', SEQ. ID. No. 1), 
corresponding to the flanking region of the poly-restriction sites of pUC 18/19 
plasmid, was used as a target template. The concentration of template was varied at 0 
5 nM, 5 nM, 10 nM, 25 nM, 50 nM, 75 nM, and 100 nM. The universal primer #1233 
(SEQ. ID. No. 5) which is a complement to the above synthetic template, was used as 
the SNP primer at a concentration of 4 uM. The reaction was set up in a total volume 
of 50 |j,L, which in addition to the template and primer, was composed of 25 mM 
ammonium acetate pH 9.3, 1 uM of each ddNTP, 2 mM magnesium acetate, and 

10 1 unit of Thermosequenase. The reaction mixture was subjected to 10, 20, 30, 40, 50, 
or 60 thermal cycles with each cycle consisting of 95°C for 30 sec, 60°C for 60 sec, 
and 72°C for 60 sec. The extension reaction samples were prepared for mass spectral 
analysis by solid phase extraction using an immobilized iminodiacetic acid gel 
column. The results are displayed in Figure 20A. 

1 5 For the double-stranded DNA experiment, a 384bp PCR product of pheA 

partial gene (SEQ. ID. No. 6) was used as the template at concentrations of 0 nM, 5 
nM, 10 nM, 25 nM, 50 nM, 100 nM, or 150 nM. In addition to the template, the 
reaction mixture also contained 4 jjM of T366pd SNP primer (SEQ. ID. No. 1 1), 1 
uM of each ddNTP, 25 mM ammonium acetate pH 9.3, 2 mM magnesium acetate, 

20 and 1-2 units of Thermosequenase. The 50 uL reaction mixture was thermally cycled 
10, 20, 30, 40, 50, or 60 times at 95°C for 30 sec, 63°C for 60 sec, and 72°C for 30 
sec. The extension reaction samples were prepared for mass spectral analysis by solid 
phase extraction using an immobilized iminodiacetic acid gel column. The results are 
displayed in Figure 20B. 

25 In the primer extension reaction, both the template concentration and the 

number of thermal cycles are important for adequate incorporation of free ddNTPs 
into unextended primers. It was determined through these optimization studies that 
there is a large difference in the ddNTP incorporation rate between extension 
reactions containing single-stranded DNA template and those containing double- 

30 stranded PCR product as template. When single-stranded DNA was used as a 

template, the following cases permitted the ddNTP to be consumed by at least 30% in 
the primer extension reaction, thereby allowing the genotype to be scored accurately 
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by ESI/MS: 10 nM template for 20 cycles, 20 nM template for 10 cycles, or 5 nM for 
30 cycles. These results are shown in Figure 20 A. Figure 20B shows that when 
double-stranded DNA was used as template, 5 nM template for 30 cycles permits 
accurate scoring. 

5 The primer extension efficiency is lower when double-stranded DNA template 

is present in the primer extension reaction than when single-stranded DNA template is 
present, as displayed in Figure 20. This can be explained by considering the 
competition that takes place in a primer extension reaction containing double-stranded 
DNA template arriving between the SNP primer and the complementary strand to 

10 hybridize to the template strand. When only single-stranded template is present, the 
competition is non-existent and, consequently, the primer extension efficiency is 
higher. This competition is the reason for which the maximum incorporation 
efficiency is obtained at 50 nM of double-stranded DNA template, using the extension 
conditions provided. At higher concentrations of double- stranded DNA, the excess 

1 5 template results in self-annealing of the template being more probable than the 

hybridization of the SNP primer to one stranded of template. As one would expect, 
increasing the SNP primer concentration from 4 uM to 6 uM increases the 
incorporation efficiency of reactions containing a high concentration of double- 
stranded DNA template. Although the incorporation efficiency for double-stranded 

20 DNA template is lower than for single- stranded PCR product, the primer extension 

efficiency was sufficient for a SNP base to be accurately assigned using only 5 nM of 
double-stranded template. Since typical PCR amplifications produce from 10" 8 M to 
10" 7 M of PCR product (Mathieu-Daude et al., Nucleic Acids Res 24: 2080-6 (1996), 
which is hereby incorporated by reference), the ESI/MS-based SNuPE assay can 

25 confidently and unambiguously assign a SNP base from double- stranded DNA 
template using 20 to 30 primer extension thermal cycles. 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without departing from the spirit and 

30 scope of the invention which is defined by the following claims. 



